
SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ForschuVigszentrixm Juelich GmbH 

(B) ADDRESS: Post^ach 1913 

(C) CITY: Juelich 

(E) COUNTRY: GEI 

(F) ZIP CODE: 5242! 

(ii) TITLE OF INVENTION: M^HOD FOR MICROBIAL PRODUCTION OF AMINO ACIDS 

Of\tHE ASPARTATE AND/OR GLUTAMATE FAMILY 

(iii) NUMBER OF SEQUENCES: 2^ 

(iv) COMPUTER -READABLE FORM: 

(A) MEDIUM TYPE: Floppi^ disk 

(B) COMPUTER: IBM PC coAipatible 

(C) OPERATING SYSTEM: PQ-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPA) 



(2) SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3728 Base Pairs 

(B) Type: Nucleotide 

(C) Strandedness : Single strc 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: Genomic DNA 



(xi) SEQ ID NO: 1; 




CGCAACCGTG 


CTTGAAGTCG 


TGCAGGTCAG 


AACAAAAACC 


GATGTTTGAT 


TGGGGGAATC 


CTATCACCCT 


TGGCGGTCTC 


TTGTTGAAAG 


CTTCAACGCT 


TCCAGCATTC 


AAAAAGATCT 


GTGCTTTCCG 


TGCAGCACTC 


GAAACCGGTG 


ATCGGGGATC 


ATTCCACCGC 


TCTTTTGCTT 


CACCAGTCAA 


GGCGTACCTG 


GACATCGATG 


CAGATGCCAT 


TTACCCGGGA 


TACGGCTTCC 



GGGAGTGTTG 
GGGGGTTACG 
GAATAATTAC 
TGGTAGCAAA 
CAGCCACGGT 
CTGAAGCTGT 
AAATTATCGG 
TGTCTGAAAA 



ICCCGAAAACA 
A^ACTAGGAC 
TC^AGTGTCG 
CCq^GGCGAA 
AGCTATTTAC 



CCGCATTGGT 
TGCAGCTAAA 
TGCCCAGCTT 



TTGAGAGGAA 60 

GCAGTGACTG 120 

ACTCACACAT 180 

ATCGCGGTCC 240 

CCCCGTGAAG 3 00 

ACCGAAGGCT 3 60 

AAAGTTAAAG 420 

GCCCGCGAGT 480 



GTGCGGAAAA CGGCATTACT 'DTTATTGGCC CAACCCCAGA GGTTCTTGAT CTCACCGGTG 540 

ATAAGTCTCG CGCGGTAACC GQCGCGAAGA AGGCTGGTCT GCCAGTTTTG GCGGAATCCA 600 

CCCCGAGCAA AAACATCGAT GAGATCGTTA AAAGCGCTGA AGGCCAGACT TACCCCATCT 660 

TTGTGAAGGC AGTTGCCGGT GGTGGCGGAC GCGGTATGCG TTTTGTTGCT TCACCTGATG 720 

AGCTTCGCAA ATTAGCAACA GAAGCATCTC GTGAAGCTGA AGCGGCTTTC GGCGATGGCG 780 

CGGTATATGT CGAACGTGCT GTGATTfiACC CTCAGCATAT TGAAGTGCAG ATCCTTGGCG 840 

ATCACACTGG AGAAGTTGTA CACCTTTJOTG AACGTGACTG CTCACTGCAG CGTCGTCACC 900 

AAAAAGTTGT CGAAATTGCG CCAGCACAGC ATTTGGATCC AGAACTGCGT GATCGCATTT 960 

GTGCGGATGC AGTAAAGTTC TGCCGCTCCAV TTGGTTACCA GGGCGCGGGA ACCGTGGAAT 102 0 

TCTTGGTCGA TGAAAAGGGC AACCACGTCT TCATCGAAAT GAACCCACGT ATCCAGGTTG 1080 

AGCACACCGT GACTGAAGAA GTCACCGAGG T^^ACCTGGT GAAGGCGCAG ATGCGCTTGG 1140 

CTGCTGGTGC AACCTTGAAG GAATTGGGTC TQ^CCAAGA TAAGATCAAG ACCCACGGTG 1200 

CAGCACTGCA GTGCCGCATC ACCACGGAAG ATCCAAACAA CGGCTTCCGC CCAGATACCG 1260 

GAACTATCAC CGCGTACCGC TCACCAGGCG GAGCTGGCGT TCGTCTTGAC GGTGCAGCTC 132 0 

AGCTCGGTGG CGAAATCACC GCACACTTTG ACTCCATGCT GGTGAAAATG ACCTGCCGTG 13 80 

GTTCCGACTT TGAAACTGCT GTTGCTCGTG CACAGCGCGC GTTGGCTGAG TTCACCGTGT 1440 

CTGGTGTTGC AACCAACATT GGTTTCTTGC GTGCGTTGCT GCGGGAAGAG GACTTCACTT 1500 

CCAAGCGCAT CGCCACCGGA TTCATTGCCG ATCACCCGcA CCTCCTTCAG GCTCCACCTG 1560 

CTGATGATGA GCAGGGACGC ATCCTGGATT ACTTGGCAGA TGTCACCGTG AACAAGCCTC 1620 

ATGGTGTGCG TCCAAAGGAT GTTGCAGCTC CTATCGATAA (^CTGCCTAAC ATCAAGGATC 1680 

TGCCACTGCC ACGCGGTTCC CGTGACCGCC TGAAGCAGCT TGGCCCAGCC GCGTTTGCTC 1740 

GTGATCTCCG TGAGCAGGAC GCACTGGCAG TTACTGATAC CACCTTCCGC GATGCACACC 1800 

AGTCTTTGCT TGCGACCCGA GTCCGCTCAT TCGCACTGAA GCCTGCGGCA GAGGCCGTCG 1860 

CAAAGCTGAC TCCTGAGCTT TTGTCCGTGG AGGCCTGGGG CGGcicGACC TACGATGTGG 1920 

CGATGCGTTT CCTCTTTGAG GATCCGTGGG ACAGGCTCGA CGAGCTGCGC GAGGCGATGC 1980 

CGAATGTAAA CATTCAGATG CTGCTTCGCG GCCGCAACAC CGTGGGATAC ACCCCGTACC 2040 

CAGACTCCGT CTGCCGCGCG TTTGTTAAGG AAGCTGCCAG CTCCGGcWg GACATCTTCC 2100 



GCATCTTCGA 


CGCGCTTAAC ' 


IGACGTCTCCC 


AGATGCGTCC 


AGCAATCGAC 


GCAGTCCTGG 


2160 


AGACCAACAC 


CGCGGTAGCC 


GAGGTGGCTA 


TGGCTTATTC 


TGGTGATCTC 


TCTGATCCAA 


2220 


ATGAAAAGCT 


CTACACCCTG 


gAttactacc 


TAAAGATGGC 


AGAGGAGATC 


GTCAAGTCTG 


2280 


GCGCTCACAT 


CTTGGCCATT 


AAGGATATGG 


CTGGTCTGCT 


TCGCCCAGCT 


GCGGTAACCA 


2340 


AGCTGGTCAC 


CGCACTGCGC 


CGTGAATTCG 


ATCTGCCAGT 


GCACGTGCAC 


ACCCACGACA 


2400 


CTGCGGGTGG 


CCAGCTGGCA 


ACCTACTTTG 


CTGCAGCTCA 


AGCTGGTGCA 


GATGCTGTTG 


2460 


ACGGTGCTTC 


CGCACCACTG 


TCTGGCACCA 


CCTCCCAGCC 


ATCCCTGTCT 


GCCATTGTTG 


2520 


CTGCATTCGC 


GCACACCCGT 


CGCGATAOCG 


GTTTGAGCCT 


CGAGGCTGTT 


TCTGACCTCG 


2580 


AGCCGTACTG 


GGAAGCAGTG 


CGCGGACTGT 


ACCTGCCATT 


TGAGTCTGGA 


ACCCCAGGCC 


2640 


CAACCGGTCG 


CGTCTACCGC 


CACGAAATCCV 


CAGGCGGACA 


GTTGTCCAAC 


CTGCGTGCAC 


2700 


AGGCCACCGC 


ACTGGGCCTT 


GCGGATCGTT 


TCGAACTCAT 


CGAAGACAAC 


TACGCAGCCG 


2760 


TTAATGAGAT 


GCTGGGACGC 


ccaaccaagg' 


"toaccccatc 


CTCCAAGGTT 


GTTGGCGACC 


2820 


TCGCACTCCA 


CCTCGTTGGT 


GCGGGTGTGG" 


"ATGCAGCAGA 


CTTTGCTGCC 


GATCCACAAA 


2880 


AGTACGACAT 


CCCAGACTCT 


GTCATCGCGT 


TCCTGCGCGG 


CGAGCTTGGT 


AACCCTCCAG 


2940 


GTGGCTGGCC 


AGAGCCACTG 


CGCACCCGCG 


CACTGGAAGG 


CCGCTCCGAA 


GGCAAGGCAC 


3000 


CTCTGACGGA 


AGTTCCTGAG 


GAAGAGCAGG 


CGCACGTCGA 


CGCTGATGAT 


TCCAAGGAAC 


3060 


GTCGCAATAG 


CCTCAACCGC 


CTGCTGTTCC 


CGAAGCGAAC 


CGAAGAGTTC 


CTCGAGCACC 


3120 


GTCGCCGCTT 


CGGCAACACC 


TCTGCGCTGG 


ATGATCGTGA 


ATTCTTCTAC 


GGCCTGGTCG 


3180 


AAGGCCGCGA 


GACTTTGATC 


CGCCTGCCAG 


ATGTGCGCAC 


CCCACTGCTT 


GTTCGCCTGG 


3240 


ATGCGATCTC 


TGAGCCAGAC 


GATAAGGGTA 


TGCGCAATGT 


TGTGGCCAAC 


GTCAACGGCC 


3300 


AGATCCGCCC 


AATGCGTGTG 


CGTGACCGCT 


CCGTTGAGTC 


\tgtcaccgca 


ACCGCAGAAA 


3360 


AGGCAGATTC 


CTCCAACAAG 


GGCCATGTTG 


CTGCACCATT 


CGCTGGTGTT 


GTCACCGTGA 


3420 


CTGTTGCTGA 


AGGTGATGAG 


GTCAAGGCTG 


GAGATGCAGT 


CSCAATCATC 


GAGGCTATGA 


3480 


AGATGGAAGC 


AACAATCACT 


GCTTCTGTTG 


ACGGCAAAAT 


CGATCGCGTT 


GTGGTTCCTG 


3540 


CTGCAACGAA 


GGTGGAAGGT 


GGCGACTTGA 


TCGTCGTCGT 


TTCCTAAACC 


TTTCTGTAAA 


3600 


AAGCCCCGCG 


TCTTCCTCAT 


GGAGGAGGCG 


GGGCTTTTTG 


GGCOAAGATG 


GGAGATGGGT 


3660 


GAGTTGGATT 


TGGTCTGATT 


CGACACTTTT 


AAGGGCAGAG 


ATTTGAAGAT 


GGAGACCAAG 


3720 



GCTCAAAG 



372 



(2) SEQ ID NO: 2: 

(i) SEQUENCE CHAR^TERISTICS : 

(A) LENGTH: 1140 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS : Single Strand 

(D) TOPOLOGY: lUnear 

(ii)TYPE OF MOLECULE : \Protein 



(xi) SEQ ID NO: 2: 

Met Ser Thr His Thr Ser\ser Thr Leu Pro Ala Phe Lys Lys He Leu 
1 5 \ 10 15 

Val Ala Asn Arg Gly Glu iVe Ala Val Arg Ala Phe Arg Ala Ala Leu 
20 \ 25 30 

Glu Thr Gly Ala Ala Thr Val Ala He Tyr Pro Arg Glu Asp Arg Gly 
35 \40 45 

Ser Phe His Arg Ser Phe Ala-^er Glu Ala Val Arg He Gly Thr Glu 
50 55 JV> 60 

Glv Ser Pro Val Lys Ala Tyr L^u Asp He Asp Glu He He Gly Ala 
65 70 \ 75 80 

Ala Lys Lys Val Lys Ala Asp Ala\lle Tyr Pro Gly Tyr Gly Phe Leu 

85 \ 90 95 

Ser Glu Asn Ala Gin Leu Ala Arg g\u Cys Ala Glu Asn Gly He Thr 
100 10^5 110 

Phe He Gly Pro Thr Pro Glu Val LeuXAsp Leu Thr Gly Asp Lys Ser 
115 120 \ 125 

Arg Ala Val Thr Ala Ala Lys Lys Ala Oly Leu Pro Val Leu Ala Glu 
130 135 \ 140 

Ser Thr Pro Ser Lys Asn He Asp Glu IlA Val Lys Ser Ala Glu Gly 
145 150 \l55 160 

Gin Thr Tyr Pro He Phe Val Lys Ala Val Ala Gly Gly Gly Gly Arg 

165 170 \ 175 

Gly Met Arg Phe Val Ala Ser Pro Asp Glu Le\i Arg Lys Leu Ala Thr 
180 185 \ 190 



Glu 



Ala Ser Arg Glu Ala Glu Ala Ala Phe GlyVAsp Gly Ala Val Tyr 



195 



200 



205 



Val Glu Arg Ala \ICal lie Asn Pro Gin His lie Glu Val Gin lie Leu 
210 \ 215 220 

Gly Asp His Thr Gl^ C3lu Val Val His Leu Tyr Glu Arg Asp Cys Ser 
225 \230 235 240 



Leu Gin Arg Arg His 

245 



iXn Lys Val Val Glu lie Ala Pro Ala Gin His 

250 255 



Leu Asp Pro Glu Leu Arg\Asp Arg lie Cys Ala Asp Ala Val Lys Phe 
260 \ 265 270 

Cys Arg Ser lie Gly Tyr Gin Gly Ala Gly Thr Val Glu Phe Leu Val 
275 \ 280 285 

Asp Glu Lys Gly Asn His Val\Phe He Glu Met Asn Pro Arg He Gin 
290 295 \ 300 

Val Glu His Thr Val Thr Glu gIu Val Thr Glu Val Asp Leu Val Lys 
305 310 \ 315 320 

Ala Gin Met Arg Leu Ala Ala Gly^Ala Thr Leu Lys Glu Leu Gly Leu 

325 IV) 330 335 

Thr Gin Asp Lys He Lys Thr His dly Ala Ala Leu Gin Cys Arg He 
340 34v5 350 

Thr Thr Glu Asp Pro Asn Asn Gly Phe\ Arg Pro Asp Thr Gly Thr He 
355 360 \ 365 

Thr Ala Tyr Arg Ser Pro Gly Gly Ala dily Val Arg Leu Asp Gly Ala 
370 375 \ 380 

Ala Gin Leu Gly Gly Glu He Thr Ala Kite Phe Asp Ser Met Leu Val 
385 390 \ 395 400 

Lys Met Thr Cys Arg Gly Ser Asp Phe Glu Vrhr Ala Val Ala Arg Ala 

405 410 \ 415 

Gin Arg Ala Leu Ala Glu Phe Thr Val Ser gW Val Ala Thr Asn He 
420 425 \ 430 

Gly Phe Leu Arg Ala Leu Leu Arg Glu Glu AsA Phe Thr Ser Lys Arg 
435 440 \ 445 

He Ala Thr Gly Phe He Ala Asp His Pro His iLeu Leu Gin Ala Pro 
450 455 'i460 

Pro Ala Asp Asp Glu Gin Gly Arg He Leu Asp i'yr Leu Ala Asp Val 
465 470 475 \ 480 



Thr Val Asn Lys Pro His Gly Val Arg Pro Lys ASp Val Ala Ala Pro 



490 



495 



lie Asp Lys Leu Pro Asn lie Lys Asp Leu Pro Leu Pro Arg Gly Ser 
500 \ 505 510 



Arg Asp Arg Leu Lys 
515 



Leu Gly Pro Ala Ala Phe Ala Arg Asp Leu 
520 525 



Arg Glu Gin Asp Ala 
530 



Ala Val Thr Asp Thr Thr Phe Arg Asp Ala 
535 540 



His Gin Ser Leu Leu Ala Thr Arg Val Arg Ser Phe Ala Leu Lys Pro 
545 550 \ 555 560 

Ala Ala Glu Ala Val Ala Lys Lys Thr Pro Glu Leu Leu Ser Val Glu 

565 \ 570 575 

Ala Trp Gly Gly Ala Thr Tyr Asp Val Ala Met Arg Phe Leu Phe Glu 
580 \ 585 590 

Asp Pro Trp Asp Arg Leu Asp Glu Leu Arg Glu Ala Met Pro Asn Val 
595 60iA 605 

Asn lie Gin Met Leu Leu Arg Gl^^rg Asn Thr Val Gly Tyr Thr Pro 
610 615 \ 620 

Tyr Pro Asp Ser Val Cys Arg Ala Phe Val Lys Glu Ala Ala Ser Ser 
625 630 \ 635 640 



Gly Val Asp lie Phe Arg lie Phe Asp Ala Leu Asn Asp Val Ser Gin 

645 6^50 655 



5^^0 



Met Arg Pro Ala lie Asp Ala Val Leu Glu Thr Asn Thr Ala Val Ala 
660 665 \ 670 



Glu Val Ala Met Ala Tyr Ser Gly Asp Leu Ser Asp Pro Asn Glu Lys 
675 680 V 685 

Leu Tyr Thr Leu Asp Tyr Tyr Leu Lys Met Ala Glu Glu lie Val Lys 
690 695 \ 700 

Ser Gly Ala His lie Leu Ala lie Lys Asp Met yCaa Gly Leu Leu Arg 
705 710 715 \ 720 

Pro Ala Ala Val Thr Lys Leu Val Thr Ala Leu Arg Arg Glu Phe Asp 

725 730 \ 735 

Leu Pro Val His Val His Thr His Asp Thr Ala Glif Gly Gin Leu Ala 
740 745 \ 750 

Thr Tyr Phe Ala Ala Ala Gin Ala Gly Ala Asp AlaWal Asp Gly Ala 
755 760 \765 



Ser Ala Pro Leu Ser Gly Thr Thr Ser Gin Pro Ser Leu Ser Ala lie 



770 



775 



780 



Val Ala Ala Phe AlaXHis Thr Arg Arg Asp Thr Gly Leu Ser Leu Glu 
785 \790 795 800 

Ala Val Ser Asp Leu dlu Pro Tyr Trp Glu Ala Val Arg Gly Leu Tyr 

805 \ 810 815 

Leu Pro Phe Glu Ser GlV Thr Pro Gly Pro Thr Gly Arg Val Tyr Arg 
820 \ 825 830 

His Glu lie Pro Gly Gly \31n Leu Ser Asn Leu Arg Ala Gin Ala Thr 
835 \ 840 845 

Ala Leu Gly Leu Ala Asp A:^ Phe Glu Leu lie Glu Asp Asn Tyr Ala 
850 855 860 

Ala Val Asn Glu Met Leu Gly\Arg Pro Thr Lys Val Thr Pro Ser Ser 
865 870 \ 875 880 

Lys Val Val Gly Asp Leu Ala lW His Leu Val Gly Ala Gly Val Asp 

885 \ 890 895 



Pro Ala Asp Phe Ala Ala Asp 
900 



^Gln Lys Tyr Asp lie Pro Asp Ser 
105 910 



Val lie Ala Phe Leu Arg Gly Glu Leu Gly Asn Pro Pro Gly Gly Trp 
915 920 \ 925 

Pro Glu Pro Leu Arg Thr Arg Ala L6u Glu Gly Arg Ser Glu Gly Lys 
930 935 \ 940 



Ala Pro Leu Thr Glu Val Pro Glu 
945 950 



Glu\ Glu Gin Ala His Leu Asp Ala 
955 960 



Asp Asp Ser Lys Glu Arg Arg Asn Ser ^eu Asn Arg Leu Leu Phe Pro 

965 970 975 

Lys Pro Thr Glu Glu Phe Leu Glu His ArW Arg Arg Phe Gly Asn Thr 
980 985 T 990 

Ser Ala Leu Asp Asp Arg Glu Phe Phe Tyr\Gly Leu Val Glu Gly Arg 
995 1000 \ 1005 

Glu Thr Leu lie Arg Leu Pro Asp Val Arg "thr Pro Leu Leu Val Arg 
1010 1015 \ 1020 

Leu Asp Ala lie Ser Glu Pro Asp Asp Lys Gly Met Arg Asn Val Val 
1025 1030 1Q35 1040 

Ala Asn Val Asn Gly Gin lie Arg Pro Met ArW Val Arg Asp Arg Ser 

1045 1050 T 1055 



Val Glu Ser Val Thr Ala Thr Ala Glu Lys Alal Asp Ser Ser Asn Lys 



1060 \ 1065 1070 

Gly His Val Ala Ala Pro Plie Ala Gly Val Val Thr Val Thr Val Ala 
1075 \ 1080 1085 

Glu Gly Asp Glu Val Lys AlaXGly Asp Ala Val Ala lie lie Glu Ala 
1090 109*^ 1100 



Met Lys Met Glu Ala Thr Ile-<fhr Ala Ser Val Asp Gly Lys lie Asp 
\^ 1105 1110 \ 1115 1120 

Arg Val Val Val Pro Ala Ala THr Lys Val Glu Gly Gly Asp Leu lie 

1125 \ 1130 1135 

Val Val Val Ser \ 
1140 \ 




